Using Syntactic Dependency-Pairs Conflation to Improve Retrieval Performance in Spanish

نویسندگان

  • Jesús Vilares
  • Francisco-Mario Barcala
  • Miguel A. Alonso
چکیده

This article presents two new approaches for term indexing which are particularly appropriate for languages with a rich lexis and morphology, such as Spanish, and need few resources to be applied. At word level, productive derivational morphology is used to conflate semantically related words. At sentence level, an approximate grammar is used to conflate syntactic and morphosyntactic variants of a given multi-word term into a common base form. Experimental results show remarkable improvements with regard to classical indexing methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards the Development of Heuristics for Automatic Query Expansion

In this paper we study the performance of linguisticallymotivated conflation techniques for Information Retrieval in Spanish. In particular, we have studied the application of productive derivational morphology for single word term conflation and the extraction of syntactic dependency pairs for multi-word term conflation. These techniques have been tested on several search engines implementing ...

متن کامل

Using syntactic dependency - pairs con ationto improve retrieval performance in Spanish ?

This article presents two new approaches for term indexing which are particularly appropriate for languages with a rich lexis and morphology, such as Spanish, and need few resources to be applied. At word level, productive derivational morphology is used to connate semantically related words. At sentence level, an approximate grammar is used to connate syntactic and morphosyntactic variants of ...

متن کامل

On the Usefulness of Extracting Syntactic Dependencies for Text Indexing

In recent years, there has been a considerable amount of interest in using Natural Language Processing in Information Retrieval research, with specific implementations varying from the word-level morphological analysis to syntactic parsing to conceptual-level semantic analysis. In particular, different degrees of phrase-level syntactic information have been incorporated in information retrieval...

متن کامل

Towards the development of heuristics

In this paper we study the performance of linguistically-motivated connation techniques for Information Retrieval in Spanish. In particular, we have studied the application of productive derivational morphology for single word term connation and the extraction of syntactic dependency pairs for multi-word term connation. These techniques have been tested on several search engines implementing di...

متن کامل

An evaluation of conflation accuracy using finite-state transducers

Purpose – To evaluate the accuracy of conflation methods based on finite-state transducers (FSTs). Design/methodology/approach – Incorrectly lemmatized and stemmed forms may lead to the retrieval of inappropriate documents. Experimental studies to date have focused on retrieval performance, but very few on conflation performance. The process of normalization we used involved a linguistic toolbo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002